Voice recognition software accuracy with second language speakers of English
Identifieur interne : 001E69 ( Main/Exploration ); précédent : 001E68; suivant : 001E70Voice recognition software accuracy with second language speakers of English
Auteurs : D. Coniam [Hong Kong]Source :
- System [ 0346-251X ] ; 1998.
Abstract
This paper explores the potential of the use of voice recognition technology with second language speakers of English. The study is a development of an earlier study conducted with a small group of native speakers (Coniam, 1998a, TEXT Technology 8.). The current study involves the analysis of the output produced by a small group of very competent second language subjects reading a text into the voice recognition software Dragon Systems ‘Dragon Naturally Speaking’. As the program is speaker-dependent and has to be trained to recognise each person's voice, subjects first spent about 45 minutes reading a training text of some 3800 words. As the test text, they then read a second text consisting of 1050 words. The output produced by the software was analysed in terms of words, sub-clausal units, clauses and t-units. In terms of accuracy, the second language speakers' output on each category of analysis was significantly lower than that achieved by the native speakers. Nonetheless, the results were consistent in line with the native speakers' scores; i.e. that the highest accuracy scores were achieved at the lowest (and most discrete) level of analysis, the word level, and the lowest scores at the t-unit, or sentence level of analysis. The paper concludes that voice recognition technology is still an at early stage of development in terms of accuracy and single-speaker dependency. Nonetheless, the fact that consistent results have emerged suggests that the development of an assessment tool, such as a reading aloud test via voice recognition technology and determining a score through an analysis of the output, may be a testing procedure with potential.
Url:
DOI: 10.1016/S0346-251X(98)00049-9
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream Istex, to step Corpus: 001226
- to stream Istex, to step Curation: 001152
- to stream Istex, to step Checkpoint: 001420
- to stream Main, to step Merge: 001F78
- to stream Main, to step Curation: 001E69
Le document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title>Voice recognition software accuracy with second language speakers of English</title>
<author><name sortKey="Coniam, D" sort="Coniam, D" uniqKey="Coniam D" first="D." last="Coniam">D. Coniam</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:9E60D439B771A434534991EDCEAEA54E8210303B</idno>
<date when="1999" year="1999">1999</date>
<idno type="doi">10.1016/S0346-251X(98)00049-9</idno>
<idno type="url">https://api.istex.fr/document/9E60D439B771A434534991EDCEAEA54E8210303B/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">001226</idno>
<idno type="wicri:Area/Istex/Curation">001152</idno>
<idno type="wicri:Area/Istex/Checkpoint">001420</idno>
<idno type="wicri:doubleKey">0346-251X:1999:Coniam D:voice:recognition:software</idno>
<idno type="wicri:Area/Main/Merge">001F78</idno>
<idno type="wicri:Area/Main/Curation">001E69</idno>
<idno type="wicri:Area/Main/Exploration">001E69</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a">Voice recognition software accuracy with second language speakers of English</title>
<author><name sortKey="Coniam, D" sort="Coniam, D" uniqKey="Coniam D" first="D." last="Coniam">D. Coniam</name>
<affiliation wicri:level="4"><country xml:lang="fr">Hong Kong</country>
<wicri:regionArea>Faculty of Education, The Chinese University of Hong Kong, Sha Tin</wicri:regionArea>
<placeName><settlement type="city">Sha Tin</settlement>
<region type="Région">Hong Kong</region>
</placeName>
<orgName type="university">Université chinoise de Hong Kong</orgName>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="j">System</title>
<title level="j" type="abbrev">SYS</title>
<idno type="ISSN">0346-251X</idno>
<imprint><publisher>ELSEVIER</publisher>
<date type="published" when="1998">1998</date>
<biblScope unit="volume">27</biblScope>
<biblScope unit="issue">1</biblScope>
<biblScope unit="page" from="49">49</biblScope>
<biblScope unit="page" to="64">64</biblScope>
</imprint>
<idno type="ISSN">0346-251X</idno>
</series>
<idno type="istex">9E60D439B771A434534991EDCEAEA54E8210303B</idno>
<idno type="DOI">10.1016/S0346-251X(98)00049-9</idno>
<idno type="PII">S0346-251X(98)00049-9</idno>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0346-251X</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass></textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">This paper explores the potential of the use of voice recognition technology with second language speakers of English. The study is a development of an earlier study conducted with a small group of native speakers (Coniam, 1998a, TEXT Technology 8.). The current study involves the analysis of the output produced by a small group of very competent second language subjects reading a text into the voice recognition software Dragon Systems ‘Dragon Naturally Speaking’. As the program is speaker-dependent and has to be trained to recognise each person's voice, subjects first spent about 45 minutes reading a training text of some 3800 words. As the test text, they then read a second text consisting of 1050 words. The output produced by the software was analysed in terms of words, sub-clausal units, clauses and t-units. In terms of accuracy, the second language speakers' output on each category of analysis was significantly lower than that achieved by the native speakers. Nonetheless, the results were consistent in line with the native speakers' scores; i.e. that the highest accuracy scores were achieved at the lowest (and most discrete) level of analysis, the word level, and the lowest scores at the t-unit, or sentence level of analysis. The paper concludes that voice recognition technology is still an at early stage of development in terms of accuracy and single-speaker dependency. Nonetheless, the fact that consistent results have emerged suggests that the development of an assessment tool, such as a reading aloud test via voice recognition technology and determining a score through an analysis of the output, may be a testing procedure with potential.</div>
</front>
</TEI>
<affiliations><list><country><li>Hong Kong</li>
</country>
<region><li>Hong Kong</li>
</region>
<settlement><li>Sha Tin</li>
</settlement>
<orgName><li>Université chinoise de Hong Kong</li>
</orgName>
</list>
<tree><country name="Hong Kong"><region name="Hong Kong"><name sortKey="Coniam, D" sort="Coniam, D" uniqKey="Coniam D" first="D." last="Coniam">D. Coniam</name>
</region>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001E69 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001E69 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Main |étape= Exploration |type= RBID |clé= ISTEX:9E60D439B771A434534991EDCEAEA54E8210303B |texte= Voice recognition software accuracy with second language speakers of English }}
This area was generated with Dilib version V0.6.32. |